Arabic Text Detection in News Video Based on Line Segment Detector

نویسندگان

  • Sadek Mansouri
  • Mbarek Charhad
  • Mounir Zrigui
چکیده

Text embedded in video sequences is very important to semantic indexing and content-based retrieval system, especially for large scale news collection. However, its detection and extraction is still an open problem due to the variety of its size and the complexity of the backgrounds. In this paper, we propose an approach for automatic Arabic-text localization based on a novel method for text-line detection. On the first stage, we use a line segment detector to detect candidate text lines. Then, we propose a word segment identification algorithm based on specific features for Arabic text in order to remove non-text lines. The last stage concerns the text line estimation and text detection in video frames. Experiment results, that we drove on a large collection of video images issued from news broadcasts show the excellent performance of our approach for text detection with different character sizes, directions and styles even in case of complex image background.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

آشکارسازی و تعیین مکان متون فارسی - عربی در تصاویر ویدیویی

Video text detection plays an important role in applications such as semantic-based video analysis, text information retrieval, archiving and so on. In this paper, we propose a Farsi/Arabic text detection approach. First, with an appropriate edge detector, edges are extracted and then by using edges cross ponts, artificial corners are extracted. Artificial corner histogram analysis is done for ...

متن کامل

Arabic News Articles Classification Using Vectorized-Cosine Based on Seed Documents

Besides for its own merits, text classification (TC) has become a cornerstone in many applications. Work presented here is part of and a pre-requisite for a project we have overtaken to create a corpus for the Arabic text process. It is an attempt to create modules automatically that would help speed up the process of classification for any text categorization task. It also serves as a tool for...

متن کامل

Precise News Video Text Detection/Localization Based on Multiple Frames Integration

This paper presents a multiple frames integration based approach to detect and localize static caption texts on news videos. Utilizing the temporal information of videos, the algorithm includes robust text features and the non-text line deletion technique, and yields precise and tight localization for detected text regions. The Canny edge detector is first applied on reference frames and is fol...

متن کامل

Document Analysis And Classification Based On Passing Window

In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...

متن کامل

Cross-Lingual Retrieval of Identical News Events by Near-Duplicate Video Segment Detection

Recently, for reusing large quantities of accumulated news video, technology for news topic searching and tracking has become necessary. Moreover, since we need to understand a certain topic from various viewpoints, we focus on identical event detection in various news programs from different countries. Currently, text information is generally used to retrieve news video. However, cross-lingual...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Research in Computing Science

دوره 132  شماره 

صفحات  -

تاریخ انتشار 2017